Integrating INQUERY with an RDBMS to Support Text Retrieval

نویسندگان

  • S. R. Vasanthakumar
  • James P. Callan
  • W. Bruce Croft
چکیده

Information is a combination of structured data and unstructured data. Traditionally, relational database management systems (RDBMS) have been designed to handle structured data. IR systems can handle text (unstructured data) very well but are not designed to handle structured data. With present day information being a combination of structured and unstructured data, there is an increasing demand for an IR-DBMS system that incorporates features of both IR and DBMSs. We discuss a framework that incorporates powerful text retrieval in relational database management systems. An extended SQL with probabilistic operators for text retrieval is deened. This paper also discusses an implementation of the probabilistic operators in SQL.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

The INQUERY Retrieval System

As larger and more heterogeneous text databases become available, information retrieval research will depend on the development of powerful, eecient and exible retrieval engines. In this paper , we describe a retrieval system (IN-QUERY) that is based on a probabilis-tic retrieval model and provides support for sophisticated indexing and complex query formulation. INQUERY has been used successfu...

متن کامل

Supporting Full-Text Information Retrieval with a Persistent Object Store

The inverted file index common to many full-text information retrieval systems presents unusual and challenging data management requirements. These requirements are usually met with custom data management software. Rather than build this custom software, we would prefer to use an existing database management system. Attempts to do this with traditional (e.g., relational) database management sys...

متن کامل

New Tools and Old Habits: The Interactive Searching Behavior of Expert Online Searches using INQUERY

We present data that describe the interactive searching behavior of ten searchers using the INQUERY retrieval engine in the context of the TREC routing task We dis cuss how these searchers with a strong background in the use of traditional online retrieval mechanisms adapted after very limited training to the use of a best match ranked output full text retrieval mechanism

متن کامل

Parallel, Platform-Independent Implementation of Information Retrieval Algorithms

The relational platform provides a flexible, low maintenance environment for integrating searches of structured and unstructured data. We present relational algebra for the Information Retrieval problem and SQL for leading probabilistic retrieval approaches. We tested 150 standard Text Retrieval Evaluation Conference queries against a collection of half a million documents. Because of the paral...

متن کامل

Text Joins for Data Cleansing and Integration in an RDBMS

An organization’s data records are often noisy because of transcription errors, incomplete information, lack of standard formats for textual data or combinations thereof. A fundamental task in a data cleaning system is matching textual attributes that refer to the same entity (e.g., organization name or address). This matching can be effectively performed via the cosine similarity metric from t...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • IEEE Data Eng. Bull.

دوره 19  شماره 

صفحات  -

تاریخ انتشار 1996